Learning Generalizable Batch Active Learning Strategies via Deep Q-networks (Student Abstract)
نویسندگان
چکیده
To handle a large amount of unlabeled data, batch active learning (BAL) queries humans for the labels most valuable data points at every round. Most current BAL strategies are based on human-designed heuristics, such as uncertainty sampling or mutual information maximization. However, there exists disagreement between these heuristics and ultimate goal BAL, i.e., optimizing model's final performance within query budgets. This leads to limited generality heuristics. this end, we formulate an MDP propose data-driven approach deep reinforcement learning. Our method learns strategy by maximizing performance. Experiments UCI benchmark show that our can achieve competitive compared existing heuristics-based approaches.
منابع مشابه
Deep Abstract Q-Networks
We examine the problem of learning and planning on highdimensional domains with long horizons and sparse rewards. Recent approaches have shown great successes in many Atari 2600 domains. However, domains with long horizons and sparse rewards, such as Montezuma’s Revenge and Venture, remain challenging for existing methods. Methods using abstraction (Dietterich 2000; Sutton, Precup, and Singh 19...
متن کاملBatch Active Learning via Coordinated Matching
We propose a novel batch active learning method that leverages the availability of high-quality and efficient sequential active-learning policies by approximating their behavior when applied for k steps. Specifically, our algorithm uses MonteCarlo simulation to estimate the distribution of unlabeled examples selected by a sequential policy over k steps. The algorithm then selects k examples tha...
متن کاملDeep Q-Learning With Recurrent Neural Networks
Deep reinforcement learning models have proven to be successful at learning control policies image inputs. They have, however, struggled with learning policies that require longer term information. Recurrent neural network architectures have been used in tasks dealing with longer term dependencies between data points. We investigate these architectures to overcome the difficulties arising from ...
متن کاملDeep Learning Quadcopter Control via Risk-Aware Active Learning
Modern optimization-based approaches to control increasingly allow automatic generation of complex behavior from only a model and an objective. Recent years has seen growing interest in fast solvers to also allow real-time operation on robots, but the computational cost of such trajectory optimization remains prohibitive for many applications. In this paper we examine a novel deep neural networ...
متن کاملDistributed Deep Q-Learning
We propose a distributed deep learning model to successfully learn control policies directly from highdimensional sensory input using reinforcement learning. The model is based on the deep Q-network, a convolutional neural network trained with a variant of Q-learning. Its input is raw pixels and its output is a value function estimating future rewards from taking an action given a system state....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i13.26989